Decompose to Adapt: Cross-Domain Object Detection Via Feature Disentanglement

نویسندگان

چکیده

Recent advances in unsupervised domain adaptation (UDA) techniques have witnessed great success cross-domain computer vision tasks, enhancing the generalization ability of data-driven deep learning architectures by bridging distribution gaps. For UDA-based object detection methods, majority them alleviate bias inducing domain-invariant feature generation via adversarial strategy. However, their discriminators limited classification due to unstable training process. Therefore, extracted features induced cannot be perfectly and still contain domain-private factors, bringing obstacles further discrepancy. To tackle this issue, we design a Domain Disentanglement Faster-RCNN (DDF) eliminate source-specific information for task learning. Our DDF method facilitates disentanglement at global local stages, with Global Triplet (GTD) module an Instance Similarity (ISD) module, respectively. By outperforming state-of-the-art methods on four benchmark UDA our is demonstrated effective wide applicability.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning to decompose for object detection and instance segmentation

Although deep convolutional neural networks(CNNs) have achieved remarkable results on object detection and segmentation, preand post-processing steps such as region proposals and non-maximum suppression(NMS), have been required. These steps result in high computational complexity and sensitivity to hyperparameters, e.g. thresholds for NMS. In this work, we propose a novel end-to-end trainable d...

متن کامل

Cross-Domain Weakly-Supervised Object Detection through Progressive Domain Adaptation

Can we detect common objects in a variety of image domains without instance-level annotations? In this paper, we present a framework for a novel task, cross-domain weakly supervised object detection, which addresses this question. For this paper, we have access to images with instance-level annotations in a source domain (e.g., natural image) and images with image-level annotations in a target ...

متن کامل

Cross-domain Human Parsing via Adversarial Feature and Label Adaptation

Human parsing has been extensively studied recently (Yamaguchi et al. 2012; Xia et al. 2017) due to its wide applications in many important scenarios. Mainstream fashion parsing models (i.e., parsers) focus on parsing the high-resolution and clean images. However, directly applying the parsers trained on benchmarks of high-quality samples to a particular application scenario in the wild, e.g., ...

متن کامل

Few-to-few Cross-domain Object Matching

Cross-domain object matching refers to the task of inferring unknown alignment between objects in two data collections that do not have a shared data representation. In recent years several methods have been proposed for solving the special case that assumes each object is to be paired with exactly one object, resulting in a constrained optimization problem over permutations. A related problem ...

متن کامل

Fisher Discriminant Analysis (FDA), a supervised feature reduction method in seismic object detection

Automatic processes on seismic data using pattern recognition is one of the interesting fields in geophysical data interpretation. One part is the seismic object detection using different supervised classification methods that finally has an output as a probability cube. Object detection process starts with generating a pickset of two classes labeled as object and non-object and then selecting ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Multimedia

سال: 2023

ISSN: ['1520-9210', '1941-0077']

DOI: https://doi.org/10.1109/tmm.2022.3141614